Cs599: Structure and Dynamics of Networked Information (spring 2005) 02/23/2005: Rank Aggregation Scribes: Ranjit Raveendran and Animesh Pathak

نویسندگان

  • Ranjit Raveendran
  • Animesh Pathak
چکیده

In previous lectures, we had discussed the problem of searching for relevant results on the WWW by exploiting the link structure. Nowadays, there are already multiple search engines giving quite good results. However, given that there are so many search engines already, employing different techniques, we may be interested in combining their positive features, and constructing a meta-search engine that uses all of their results. This leads very naturally to the problem of rank aggregation: given several orders on items (such as search results), determine a consensus ordering, which somehow reflects all orderings together. This problem can be studied from different perspectives. We could treat it as a machine learning problem, by treating each search engine as an “expert”, and trying to learn which engine’s advice to trust, based on past performance. The goal would then be to perform no worse than the best expert, but without knowing ahead of time which one is actually the best expert. Another approach, which we will explore in future lectures, is to consider it as an optimization problem. By defining an appropriate notion of distance between rankings, we can then look for a ranking that is close to all given rankings in a certain sense. A third approach, to be pursued in this and the following lecture, exploits the close connection between rank aggregation and voting. In both settings, we have items (candidates or web pages), and orderings on them (voter preferences or search engine rankings). The goal is to find a ranking that is “agreeable” to all voters/search engines. By using this analogy, we can leverage several centuries of thought on the issue of voting and social choice.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cs599: Structure and Dynamics of Networked Information (spring 2005) 04/25/2005: Analysis of Gossip-based Averaging Scribes: David Kempe 1 Averaging Using Gossip

In order to analyze this protocol, we will track what “share” of each node j’s number xj is currently contributing to each node’s sum si. So we look at a vector ~vi, in which the j component vi,j denotes the fraction of node j’s value that currently is part of node i’s sum. This means that initially, we have vi,i = 1 for all i, and vi,j = 0 for all i 6= j. In this view, our protocol can be expr...

متن کامل

CS599: Structure and Dynamics of Networked Information (Spring 2005) 02/02/2005: α-communities

Last time, we looked at a notion of community where we required high overall edge density among groups of nodes. This made no requirement on individual nodes. In particular, the definition favors the inclusion of high-degree nodes (in WWW terms: popular sites, e.g., yahoo.com, google.com, cnn.com). One may argue that communities should consist of nodes which predominantly belong to that communi...

متن کامل

Cs599: Structure and Dynamics of Networked Information (spring 2005) 02/16/2005: Uniform Labeling Problem Scribes: Yuriy Brun and Nupur Kothari 1 Uniform Labeling Problem via Lp Approximation

In the last lecture, we started studying the uniform labeling problem, where labels are to be assigned to nodes minimizing the combination of two cost objectives: a labeling cost for assigning a particular label at a particular node, and a separation cost for having different labels at adjacent nodes. The problem is known to be NP-hard for 3 or more labels, so we are looking for approximation a...

متن کامل

Cs599: Structure and Dynamics of Networked Information (spring 2005) 03/09/2005: Power Laws via Optimization Scribes: Ranjit Raveendran and Farnoush Banaei-kashani

In the past lecture, we investigated the preferential attachment model for the generation of power law degree distributions. While “rich-get-richer” models may be considered close to the truth for WWW-like graphs, they do not seem appropriate for graphs such as the Internet, as there appears no natural reason why nodes would choose to attach to high-degree nodes (or be more likely to find out a...

متن کامل

CS599: Structure and Dynamics of Networked Information (Spring 2005) 02/07/2005: The Search for Communities

We previously looked at communities as simply dense subgraphs graphs, or as subgraphs such that each node has a large fraction of its edges inside. Another view, taken in [3], starts from the hubs and authorities model. It argues that a structure of densely linked hubs and authorities is a common feature of communities. Such a core, a dense bipartite graph, can be considered the “signature” of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005